Implementation and evaluation of a text-to-speech synthesis system for turkish

نویسندگان

Özgül Salor-Durna

Bryan L. Pellom

Mübeccel Demirekler

چکیده

In this paper, a diphone based Text-to-Speech (TTS) system for the Turkish language is presented. Turkish is the official language of Turkey, where it is the native language of 70 million people and it is also widely spoken in Asia (Azerbaidjain, Uzbekhstan, Kazakhstan, Kirgizhstan and Iran), Cyprus and the Balkans. The research has been done through a visiting internship at CSLR (the Center for Spoken Language Research, University of Colorado at Boulder) as part of an ongoing collaboration between CSLR and METU (Middle East Technical University), Department of Electrical and Electronics Engineering. The system is based on Festival Speech Synthesis System. A diphone database has been designed for Turkish. Tools developed for quick diphone collection and segmentation are illustrated. The text analysis module, the methods used for determination of segment durations and pitch contours are discussed in detail. A Diagnostic Rhyme Test (DRT) has been designed for Turkish to test the intelligibility of the output speech. The resulting TTS system is found to be 86.5% intelligible on the average by 20 listeners. This is the first diphone based Turkish TTS system, whose intelligibility is reported. We also believe that, this paper would help researchers working on building TTS voices, especially those who work on agglutinative languages, since every step needed along the way are explained in detail.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An implementation and evaluation of two diphone-based synthesizers for Turkish

This paper presents two diphone based Turkish text-to-speech systems; the first system is realized inside the MBROLA project, a freely available multilingual speech synthesizer and the second system is based on shape invariant harmonic modeling. Both synthesizers use the same parametric representations of two diphone databases (male, female) obtained by processing speech data with a pitch async...

متن کامل

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

A Corpus-Based Concatenative Speech Synthesis System for Turkish

Speech synthesis is the process of converting written text into machine-generated synthetic speech. Concatenative speech synthesis systems form utterances by concatenating pre-recorded speech units. Corpus-based methods use a large inventory to select the units to be concatenated. In this paper, we design and develop an intelligible and natural sounding corpus-based concatenative speech synthes...

متن کامل

مراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی

Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...

متن کامل

Design and Implementation of an Intelligent Part of Speech Generator

The aim of this paper is to report on an attempt to design and implement an intelligent system capable of generating the correct part of speech for a given sentence while the sentence is totally new to the system and not stored in any database available to the system. It follows the same steps a normal individual does to provide the correct parts of speech using a natural language processor. It...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Implementation and evaluation of a text-to-speech synthesis system for turkish

نویسندگان

چکیده

منابع مشابه

An implementation and evaluation of two diphone-based synthesizers for Turkish

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

A Corpus-Based Concatenative Speech Synthesis System for Turkish

مراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی

Design and Implementation of an Intelligent Part of Speech Generator

عنوان ژورنال:

اشتراک گذاری